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Abstract 

We study the inhomogeneous continuum random trees (ICRT) that arise as weak limits 
of birthday trees. We give a description of the exploration process, a function defined on 
[0, 1] that encodes the structure of an ICRT, and also of its width process, determining 
the size of layers in order of height. These processes turn out to be transformations 
of bridges with exchangeable increments, which have already appeared in other ICRT 
related topics such as stochastic additive coalescence. The results rely on two different 
constructions of birthday trees from processes with exchangeable increments, on weak 
convergence arguments, and on general theory on continuum random trees. 
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1 Introduction 



This paper completes one circle of ideas (describing the inhomogeneous continuum random tree) 
while motivated by another (limits of non-uniform random p-mappings which are essentially 
different from the uniform case limit). Along the way, a curious extension of Jeulin's result on 
total local time for standard Brownian excursion will be established. 

Consider a continuous function / : [0, 1] — > [0, oo) which is an "excursion" in the sense 

/(0) = /(l) = 0; /(u)>0, 0<«<1. 
Use / to make [0, 1] into the pseudo-metric space with distance 

d(ui, u 2 ) := (f(ux) - inf f (u)) + (f (u 2 ) - mf /(it)), iti < u 2 . (1) 

Ul<U<U2 Ul<U<U2 

After taking the quotient by identifying points of [0, 1] that are at d-pseudo distance 0, this 
space is a tree in that between any two points there is a unique path; it carries a length measure 
induced by the distance d, and a mass measure, with unit total mass, induced from Lebesgue 
measure on [0,1]. An object with these properties can be abstracted as a continuum tree. 
Using a random excursion function yields a continuum random tree (CRT): Aldous [3 El- The 
construction of a continuum random tree T via a random function /, in this context called the 
exploration process of T (in Le Gall et al. "211 E], it is instead called height process while the 
term exploration process is used for a related measure- valued process), is not the only way of 
looking at a CRT; there are also 

(a) constructions via line-breaking schemes 

(b) descriptions via the spanning subtrees on k random points chosen according to mass measure 

(c) descriptions as weak or strong n — > oo limits of rescaled n-vertex discrete random trees. 
As discussed in [21 E] the fundamental example is the Brownian CRT, whose exploration process 
is twice standard Brownian excursion (this was implicit in Le Gall j2I~]), with line-breaking 
construction given in Aldous pQ, spanning subtree description in Aldous [3] and Le Gall (21], and 
weak limit (for conditional Galton- Watson trees) behavior in (21 El (see Marckert and Mokkadem 
[2*7] for recent review). A more general model, the inhomogeneous continuum random tree 
(ICRT) T 6 , arose in Camarri and Pitman as a weak limit in a certain model (p-trees) 
of discrete random trees. The definition and simplest description of T is via a line-breaking 
construction based on a Poisson point process in the plane (Aldous and Pitman ["""3 El), which 
we recall below. The spanning subtree description is set out in Aldous and Pitman [S] , and the 
main purpose of this paper is to complete the description of T e by determining its exploration 
process (Theorem [I]). 

1.1 Statement of results 

The parameter space © of the ICRT T e is defined [9 to consist of sequences = (0 O , 0%, 2 , . . .) 
such that 

(i) 6>o > 0; 0! > 2 > . . . > 0; 

(ii) = 

(iii) if £"i 0i<oo then O > 0. 
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We will often consider the finite-length subspace ©finite of © for which 9{ — Vi > /, for some 
/ > 0, calling / the length of 6. Note that £ ©finite can be specified by specifying a decreasing 

sequence . . . , 6 1 /) for which X][=i ^ 2 < 1) then set Q = ^/l — X]j>i ^ 2 > 0- 

Let {(C/j,T^),i > 1} be a Poisson measure on the first octant {(x, y) : < y < x}, with 
intensity 9% per unit area. For every i > 1 let also (£i,j, j > 1) be a Poisson process on the 
positive real line with intensity 9{ per unit length. The hypotheses on 6 entail that the set of 
points {Ui,i > > l,j > 2} is discrete and can be ordered as < r/i < r] 2 < . . ., we call 

them cutpoints. It is easy to see that r] k+1 — r] k — > as k — > oo. By convention let rj = 0. Given 
a cutpoint rj k , k > 1, we associate a corresponding joinpoint rjl as follows. If the cutpoint is of 
the form U{, then t]l = Vi. If it is of the form with j > 2, we let nf k = The hypothesis 
#o > or J2i>i 9i = oo implies that joinpoints are a.s. everywhere dense in (0, oo). 

The tree is then constructed as follows. Start with a branch [0,r/i], and recursively, given 
the tree is constructed at stage J, add the line segment by branching its left-end 

to the joinpoint rfj (notice that rfj < r\j a.s. so that the construction is indeed recursive as 
J increases). When all the branches are attached to their respective joinpoints, relabel the 
joinpoint corresponding to some £^1 as joinpoint j, and forget other labels (of the form 77^ or 
77*). We obtain a metric tree (possibly with marked vertices 1,2,...), whose completion we call 
T e . 

A heuristic description of the structure of the ICRT goes as follows. When 9q = 1 and 
hence 0* = for % > 1, the tree is the Brownian CRT, it has no marked vertex and it is a.s. 
binary, meaning that branchpoints have degree 3. It is the only ICRT for which the width 
process defined below is continuous, and for which no branchpoint has degree more than 3. 
When £ ©finite has length I > 1, the structure looks like that of the CRT, with infinitely 
many branchpoints with degree 3, but there exist also exactly / branchpoints with infinite 
degree which we call hubs, and these are precisely the marked vertices 1,2,...,/ corresponding 
to the joinpoints ^1,1, • • • , associated to the Poisson processes with intensities 9 1 ,...,9j 
defined above. The width process defined below has I jumps with respective sizes 8%, . . . , 9j, 
which occur at distinct times a.s. These jump-sizes can be interpreted as the local time of the 
different hubs - see remark following Theorem El When 6 £ ©finite, then the hubs become 
everywhere dense on the tree. Whether there exists branchpoints with degree 3 or not depends 
on whether 9 7^ or 9 = 0. Also, the tree can become unbounded. 

It turns out that the relevant exploration process is closely related to processes recently 
studied for slightly different purposes. The Brownian CRT in j7|, and then the ICRT in 0, 
were used by Aldous and Pitman to construct versions of the standard, and then the general, 
additive coalescent, and its dual fragmentation process, which are Markov processes on the 
state space A of sequences {(xi, x%, . . .) : X{ > 0, J2i x i = !}• m 13 the time-t state X(t) is 
specified as the vector of masses of tree- components in the forest obtained by randomly cutting 
the Brownian CRT or ICRT at some rate depending on t. Bertoin ^2] gave the following more 
direct construction. Let (i?° xc ,0 < s < 1) be standard Brownian excursion. For fixed t > 
consider the process of height-above-past-minimum of 

S s exc -ts, < s < 1. 

Then its vector of excursion lengths is A-valued, and this process (as t varies) can be identified 



3 



with the standard case of the additive coalescent. More generally, for 6 G © consider the 
"bridge" process 

oo 

6 B^ +£)0 i (% i < a} -s), 0<s<l 
1=1 

where (Ui) are independent random variables with uniform law on (0,1). Use the Vervaat 
transform - relocate the space-time origin to the location of the infimum - to define an "ex- 
cursion" process X° = (Xf, < s < 1) which has positive but not negative jumps. Bertoin 
[T3j used X to construct the general additive coalescent, and Miermont [2H] continued the 
study of fragmentation processes by this method. In this paper we use X e to construct a 
continuous excursion process Y e ; here is the essential idea. A jump of X at time tj defines 
an interval [tj,Tj] where Tj := inf{£ > tj : Xf = Xf }. Over that interval, replace Xf by 
Xfj_ +Xf — inf fj < u < s Xf. Do this for each jump, and let Y e be the resulting process. To write 
it in a more compact way, the formula 

Y = m\ inf Xf : < u < s 

I u<r<s 

holds, where m is Lebesgue's measure on K. Details are given in section |21 We can now state 
our main result. 

Theorem 1. Suppose G © satisfies < 00 ■ Then the exploration process of the ICRT 

T e is distributed as -mY . 

As will be recalled in Sect. the precise meaning of this theorem is: let U\, U%, ... be 
independent uniform variables on [0, 1], independent of Y , and as around ((TJ), replacing / by 
^■F , endow [0, 1] with a pseudo-distance d, so that the natural quotient gives a tree T 29 " Y 

where Y = Y° . Then for every J G N, the subtree spanned by the root (the class of 0) 
and the (classes of) U\,...,Uj has the same law as the tree Tf obtained by performing the 
stick-breaking construction until the J-th step. Since (Ui,i > 1) is a.s. dense in [0,1] and by 
uniqueness of the metric completion, T e and T 2e ° Y indeed encode the same random topological 
space. We also note that our proofs easily extend to showing that the hub with extra label i is 
associated to the class of tj or Tj, and this class is exactly {s G [U,Ti] : Yf = Y t 6 }. To avoid 
heavier notations, we will not take these extra labels into account from now on. 

When J2i @i = °°i the exploration process of the ICRT, if it exists, can be obtained as a 
certain weak limit of processes of the form j^t^Y 6 " for approximating sequences 9 n G ©finite, 

and in particular, when 9q > one guesses that the exploration process of T will still be -wiY , 
but we will not concentrate on this in the present paper. 

Remark. Formula (J2J is inspired by the work of Duquesne and Le Gall [IZ|, in which continuum 
random trees ( "Levy trees" ) are built out of sample paths of Levy processes. Our work suggest 
that there are many similarities between ICRTs and Levy trees. In fact, Levy trees turn out to 
be "mixings" of ICRTs in an analogous way that Levy bridges are mixing of extremal bridges 
with exchangeable increments. This will be pursued elsewhere. 

In principle Theoremnshould be provable within the continuous-space context, but we do not 
see such a direct proof. Instead we use weak convergence arguments. As background, there are 
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many ways of coding discrete trees as walks. In particular, one can construct a Galton- Watson 
tree with offspring distribution £ in terms of an excursion of the discrete-time integer-valued 
random walk with step distribution £ — 1. In fact there are different ways to implement the 
same construction, which differ according to how one chooses to order vertices in the tree, and 
the two common choices are the depth-first and the breadth-first orders. In section |3] we give a 
construction of a random rz-vertex p-tree, based on using n i.i.d. uniform(0, 1) random variables 
to define an excursion- type function with drift rate —1 and with n upward jumps, and again 
there are two ways to implement the construction depending on choice of vertex order. These 
constructions seem similar in spirit to, but not exactly the same as, those used in the server 
system construction in or the parking process construction in Chassaing and Louchard 
[To] . When 6 6 ©finite, by analyzing asymptotics of the (appropriately rescaled) discrete 
excursion using depth-first order, in the asymptotic regime where convergence to the ICRT 
holds, we get weak convergence to the process Y , and we show that this discrete excursion 
asymptotically agrees with Q\j1 times the discrete exploration process; we extend this to the 
case £\ 9i < oo by approximating the tree T e by the tree T e " associated to the truncated 
sequence . . . , 6 n , 0, . . .), and that is the proof of Theorem UJ It is a curious feature of the 
convergence of approximating p-trees to T that the rescaled discrete approximation process 
converges to %iY e for a topology which is weaker than the usual Skorokhod topology. In the 
course of proving Theorem [TJ we will give sufficient conditions for this stronger convergence to 
happen. 

For any continuum tree with mass measure /x, we can define 

W(h) = fi{x : ht(a;) < h}, h>0 

where the height ht(x) of point x is just its distance to the root. If W(h) = W(y) dy, h > 
then W(y) is the "width" or "height profile" of the tree (analogous to the size of a particular 
generation in a branching process model). The time-changed function (W(W~ l (u)), < u < 1) 
can be roughly interpreted as the width of the layer of the tree containing vertex u, where 
vertices are labelled by [0, 1] in breadth-first order. Parallel to (but simpler than) the proof 
of Theorem sketched above, we show that excursions coding p-trees using breadth-first order 
converge to X , and agree asymptotically with the height profile (sizes of successive generations) 
of the p-tree. In other words 

Theorem 2. Let 6 e 0. For the ICRT T the width process W(y) = W e (y) exists, and 

(W e ({W e y\u))^< u < 1) = (X (u),O < u < I). 

Qualitatively, in breadth-first traversal of the ICRT, when we encounter a hub at some 
< u < 1 we expect the time-changed width function W{W~ l (u)) to jump by an amount 
representing a "local time" measuring relative numbers of edges at that hub. Theorem |21 shows 
these jump amounts are precisely the ^-values of the hubs. 

When Yli 0i < oo, combining Theorems ^ and |21 gives a result whose statement does not 
involve trees: 
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Corollary 1. Let 6 £ satisfy ^2 i 9i < oo. TTie process -§iY has an occupation density 
(W (y),O <y< oo) satisfying 

(W 6 ({W e )-\u))^< u < 1) = (X»,0 < it < 1). 

Note that the "Lamperti-type" relation between W 6 and A e is easily inverted as 

(XU {y) ,y>O)^(W (y),y>O), (3) 

where 

L{t):= [ ^G[0,oo], < t < 1. 

JO A s 

This provides a generalization of the following result of Jeulin |T§] (see also Biane-Yor |14j). 

which from our viewpoint is the Brownian CRT case where 6 = 1. Let (l u ,0 < u < oo) be 
occupation density for (l^ xc ,0 < s < 1). Then 

(iVa. < w < oo) = < M < °°) 

where L(t) := J Q * ^|^ds. One might not have suspected a possible generalization of this identity 
to jump processes without the interpretation provided by the ICRT. 

Theorem El has the following other corollary: 
Corollary 2. For any 6 £ ; £/ie height sup v( - T e ht(v ) of the ICRT T has the same law as 

r 1 ds 

Jo x~r 

1.2 Discussion 

As formulated above, the purpose of this paper is to prove Theorems [T] and El concerning the 
ICRT. But we have further motivation. As ingredients of the proof, we take a known result 
(Proposition on weak convergence of random p-trees to the ICRT, and improve it to stronger 
and more informative versions (Propositions El and The Theorems and these ingredients will 
be used in a sequel [2] studying asymptotics of random p-mappings. By using Joyal's bijection 
between mappings and trees, one can in a sense reduce questions of convergence of p-mappings 
to convergence of random p-trees. In particular, under a uniform asymptotic negligibility hy- 
pothesis which implies that the exploration process of p-trees converges to Brownian excursion, 
one can use a "continuum Joyal functional" (which takes Brownian excursion to reflecting Brow- 
nian motion) to show that the exploration process of the random p-mappings converges to 
reflecting Brownian bridge. The results of the present paper give the limit exploration process 
Y for more general sequences of p-trees, and to deduce convergence of the associated random 
p-mappings we need to understand how the continuum Joyal functional acts on Y e . This is 
the subject of the sequel jH]. 
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2 Constructing X e and Y e 

Let 6 6 0, and consider a standard Brownian bridge B hr , and independent uniformly dis- 
tributed random variables (£/,, i > 1) in [0, 1], independent of B hr . Define 

oo 

x^ e = e B? + Hhu^t} -t), o < t < i. (4) 
i=i 

From Kallenberg j20j, the sum on the right converges a.s. uniformly on [0, 1]. Then X hr,e has 
exchangeable increments and infinite variation, and by Knight and Bertoin it attains 
its overall minimum at a unique location t min , which is a continuity point of X hr,e . Consider 
the Vervaat transform X e of X hr,e , defined by 

X° = X%f. -X?' , < t < 1, (5) 

where the addition is modulo 1. Then X e is an excursion- type process with infinite variation, 
and a countable number of upward jumps with magnitudes equal to (9i,i > 1). See Figured] 
Write tj = Uj — i m i n (mod. 1) for the location of the jump with size 9j in X°. 

For each j > 1 such that 9j > 0, write Tj = inf{s > tj : Xf = Xf _}, which exists because 
the process X has no negative jumps. Notice that if for some i ^ j one has tj e (t^Tj), then 
one also has Tj e (tj,Tj), so the intervals (ti,T,) are nested. Given a sample path of X , for 
< u < 1 and z > 1 such that 9i > 0, let 

Rf (u) = { 5 ~ K - \ U E % Tl] (6) 

1 [ else. 

If $i = then let Rf be the null process on [0, 1]. We then set 

Y° = X°-J2Rf, (7) 

i>l 

which is defined as the pointwise decreasing limit of ^-^k^,, Rf as n — > oo. See Figure|21 It 
is immediate that is a non-negative process on [0, 1]. More precisely, for any < u < s < 1 
and z such that it > U, Rf(u) is equal to the magnitude of the jump (if any) accomplished at 
time ti by the increasing process 

X e s {u) = inf X?, 0<u<s. 

u<r<s 



Since the Lebesgue measure of the range of an increasing function (/(s),0 < s < t) is 
f(t) — /(0) minus the sum of sizes of jumps accomplished by /, we obtain that 

Yf = m{X e s (u) : < u < s} < s < 1, (8) 

where m is Lebesgue measure. This easily implies that Y e is a continuous (possibly null) 
process, and since the largest jump of X e — J2i<i< n Rf ^ s ^n+i, which tends to as n — > oo, a 
variation of Dini's theorem implies that (JZJ) holds in the sense of uniform convergence. 
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Figure 1: A realization of (Xf,0 < s < 1) with / = 3 and (9 ,9 1 ,9 2 ,9 3 ) = 
(0.862,0.345,0.302,0.216) (/ = 3). The jumps are marked with dashed lines; the jump of 
height 9i occurs at time U. 
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Figure 2: The process (Yf, < s < 1) constructed from the process (Xf, < s < 1) in Figure 
[0 The "reflecting" portions of the path corresponding to jumps of X are marked by the 0j 
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The process Y is an excursion-type process on (0, 1). Moreover, since by classical properties 
of Brownian bridges the local infima of X hr,e are all distinct, the only local infima that Y e 
attains an infinite number of times are in the intervals [ti, Ti]. Let us record some other sample 
path properties of Y e . 

Lemma 1. Suppose 6 G © has length I G N U {oo} and 9 > 0. Almost surely, the values 
(X®_,i > 1) taken by X e at its jump times are not attained at local minima of X . Also, the 
times ti are a.s. not right-minima of X in the sense that there does not exist e > such that 



Proof. Let X^' (s) = X^ T,e — ^(Im^,] — s), which is independent of C/j. The shifted process 
X br ' e (- + t) — X^ r,0 (t) (with addition modulo 1) has same law as X br,e for every t, so the fact 
that 1 is not the time of a local extremum for X br,e and that |X br,0 (l — t)\/t — > oo as f - > 
(e.g. by j2H Theorem 2.2 (i)] and time-reversal) implies by adding back ^i(l{c/ i <-} — •) to X br ' e 
that Ui is a.s. not a local minimum of X hr,e . The statement about right-minima is obtained 
similarly, using the behavior of X br ' e at rather than 1. 

Next, since X hv,e is the sum of a Brownian bridge B hr and an independent process, the 
increments of X hr,e have continuous densities, as does the Brownian bridge (except of course 
the increment X hr ' e (l) - X hl > e (0) = a.s.). The probability that the minimum of X ' in any 
interval [a, b] with distinct rational bounds not containing Ui equals X^' is therefore 0. This 
finishes the proof. □ 

The following lemma will turn out to be useful at the end of the proof of Theorem 

Lemma 2. Let 6 G satisfy J2i ^ < °°> an d write n = (9 , 9i, . . . , 9 n ). Define X " as above, 
but where the sum defining X hr,e is truncated at n. Last, define Y e as in |2j) with X e instead 
of X 8 . Then Y e " converges a.s. uniformly to Y e as n — > oo. 

Proof. We want to estimate the uniform norm \\Y° n — Y \\, which by definition is — X° — 
^ i>1 (i?f" — Rf)\\ with obvious notations. The first problem is that X hT ' e " may not attain its 
overall infimum at the same time as X hr ' e , so that jump times for X e " and X e may not coincide 
anymore. So, rather than using X e " we consider X' n (s) = X hr ' 0n (s + t min ) — X br,£P (t m i n ) (with 
addition modulo 1) where t m { n is the time at which X hr,e attains its infimum. Then X' n — > X e 
uniformly. Define R' n i as in (JHJ) but for the process X' n and write Y' n = X' n — 'Yl l i<i<n ^nt- Notice 
that Y£ is just a slight space-time shift of Y 0n , so by continuity of Y e " and Y e it suffices to 
show that Y' n — > Y e uniformly. It is thus enough to show that || Ei<i<n(Ki — ^ as 

n — > oo. It is easy that for each i > 1, one has uniform convergence of R' ni to Rf. Therefore, 
it suffices to show that 



which is trivial because H-R^J < 9i, and Ylii^i < oo by hypothesis. 

Remark. Again, one guesses that the same result holds in the general 9 > case, so that the 
proof of Theorem n should extend to this case. However, the fact that m ight be infinite 

does not a priori prevent vanishing terms of the sum Xa<i<n-^n i t° accumulate, so the proof 
might become quite technical. 



Xf>X^forse[t t ,t t + e]. 
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Figure 3: A planar tree, with the two orderings of vertices as a,b,c,d,e,f,g,h 

3 Constructions of p-trees and associated excursion pro- 
cesses 

Write T n for the set of rooted trees t on vertex-set [n], where t is directed towards its root. Fix a 
probability distribution p = (pi, . . . ,p n )- Recall that associated with p is a certain distribution 
on T n , the p-tree 

P(T = t) = TTi^, d v in-degree of v in t. (9) 

V 

See jHI] for systematic discussion of the p-tree model. We shall define two maps ip v : [0, l) n — > 
T„ such that, if (X±, . . . ,X n ) are independent U(0, 1) then each ip p (Xi, . . . ,X n ) has the dis- 
tribution 0. The two definitions are quite similar, but the essential difference is that ^ readth 
uses a breadth-first construction whereas ^epth ugeg a d e ptJi.fi rs i construction. 

3.1 The breadth-first construction 

The construction is illustrated in Figure |H Fix distinct (xi, . . . ,x n ) G [0, l) n . Picture this as a 
configuration of particles on the circle of unit circumference, where particle i is at position Xi 
and has a "weight" Pi associated with it. Define 

FP( U ) = - U + J2pil {xi < u} , 0<u<l. (10) 

i 

There exists some particle v such that F p (x v — ) = mf u F p (u): assume the particle is unique. 
Let v — vi, v%, . . . , v n be the ordering of particles according to the natural ordering of positions 
x Vl < x V2 < . . . around the circumference of the circle. (In Figure |U we have V\ — 4 and the 
ordering is 4, 8, 2, 3, 7, 1, 5, 6). Write y(l) = x Vl and for 2 < j < n let y(j + l) = y(j)+p Vj m od 1. 
So y{n + l) = y{l) and the successive intervals [y(j), y(j + 1)], 1 < j < n are adjacent and cover 
the circle. We assert 

x Vj e[y(l),y(j)), 2<j<n. (11) 
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To argue by contradiction, suppose this fails first for j. Then [y(l),y(j)), interpreted mod 1, 
contains particles Vi, . . . , Vj_i only. Since y(j)—y(l) = p Vl + . . ■+p v _ 1 this implies F p (y(j)—) = 
F p (y(l)—), contradicting uniqueness of the minimum. 

We specify the tree ipp Tea,dth (xi, . . . , x n ) by: 
v i is the root 

the children of Vj are the particles v with x v G (y(j), y(j + 1)). 

By any child Vj. of Vj has k > j, so the graph cannot contain a cycle. If it were a forest and 
not a single tree, then the component containing the root v\ would consist of vertices v i, . . . , Vj 
for some j < n. Then the interval [y(l),y(j + 1)] would contain only the particles v i, . . . , Vj, 
contradicting (fTT]) for j + 1. 

Thus the construction does indeed give a tree. From the viewpoint of this construction it 
would be natural to regard the tree as planar (or ordered: the d v children of v are distinguished 
as first, second, etc) but we disregard order and view trees in T„ as unordered. 

Now consider the case where (x\, . . . , x n ) = (Xi, . . . , X n ) are independent U (0, 1). Fix an 
unordered tree t and write V\ for its root. Fix an arbitrary x Vl G (0, 1) and condition on 
X V1 = x Vl . Consider the chance that the construction yields the particular tree t. For this 
to happen, the particles corresponding to the d Vl children of v\ must fall into the interval 
[x Vl ,x Vl +p vi ], which has chance p Vl l . Inductively, for each vertex v an interval of length p v is 
specified and it is required that d v specified particles fall into that interval, which has chance 
p% v . So the conditional probability of constructing t is indeed the probability in and hence 
so is the unconditional probability. 

Remark. Note that in the argument above we do not start by conditioning on F p having its 
minimum at x vi , which would affect the distribution of the (Xi). 

We now derive an interpretation (jl3H4j) of the function F p at (|10p. which will be used in 
the asymptotic setting later. From now on we also suppose that for j > 2, y(j) is not a jump 
time for F p to avoid needing the distinction between F p (y(j)) and F p (y(j)—); this is obviously 
true a.s. when the jump times are independent uniform, which will be the relevant case. 

For 2 < j < n, vertex Vj has some parent v z u\, where 1 < z(j) < j. By induction on j, 

F p (y(j))-F p (y(l)-)= £ p Vi . 

i:i>j,z(i)<j 

In words, regarding t as ordered, the sum is over vertices i which are in the same generation 
as j but later than j; and over vertices i in the next generation whose parents are before j or 
are j itself. For h > 1, write t(h) for the number of vertices at height < h — 1. The identity 
above implies 

F p (y(t(h) + l))-F p (y(l)-)= p v . 

v:ht(v)=h 

Also by construction 

y(t(h) + 1) - y(l) mod 1 = P«" 

v.ht(v)<h-l 
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wt of gen wt of gen 1 wt of gen 2 wt of gen 3 



Figure 5: _F exc,p (-) codes the weights of successive generations (wt of gen) of the p-tree in Figure 

El 

We can rephrase the last two inequalities in terms of the "excursion" function 

F exc ' p (w) := F p (y(l) + u mod 1) - F p (y(l)-) 1 < u < 1 (12) 
and of u(h) := y(t(h) + 1) — y(l) mod 1. Then 

U (h) pv ( 13 ) 

v:ht(v)<h-l 

F e ^ p (u(h)) = Pv ( 14 ) 

v:ht(v)=h 

So the weights of successive generations are coded within F e * c,p (-), as illustrated in Figure El 
Note that to draw Figure we replace x,- t by 

x[ := Xi — y(l) mod 1. 

Remark. There is a queuing system interpretation to the breadth-first construction, which 
was pointed out to us by a referee. In this interpretation, the customer labelled i arrives at 
time x\ and requires a total service time pj. If customers are served according to the FIFO rule 
(first-in first-out) then F exc,p (u) is the remaining amount of time needed to serve the customers 
in line at time u. 

3.2 The depth-first construction 

The construction is illustrated in FigurelHl using the same (xi) and (pi) as before, and hence the 
same F p (u). In the previous construction we "examined" particles in the order Vi, t>2, . . . , v n ; 
we defined y(l) = V\ and inductively 
• y(j + 1) = y(j) + Pvj mod 1 



14 



• the children of Vj are the particles v with x v G (y(j), y(j + 1)). 

In the present construction we shall examine particles in a different order Wi, W2, ■ ■ ■ , w n and 
use different y'(j) to specify the intervals which determine the offspring of a parent. Start as 
before with W\ = V\ and y'(l) = x wi . Inductively set 

• y'U + 1) = y'U) + Pwj mod 1 

• the children of Wj are the particles v with x v G (y'(j),y'(j + 1))- 

• Wj + i is 

the first child of Wj, if any; else 

the next unexamined child of parent (wj), if any; else 

the next unexamined child of parent (parent (wj)), if any; else 

and so on. 

Here "unexamined" means "not one of Wi, . . . , wf and "next" uses the natural order of children 
of the same parent. 

Figure l^and the following paragraph talk through the construction in a particular example, 
using the same (x») and (p$) as in Figure |U Checking that ?/>p epth (Xi, . . . ,X n ) has distribution 
0, i.e. is a random p-tree, uses exactly the same argument as before. 

As in Figure EJ the root of the tree is vertex 4 (wi = 4), and we set y'(l) = £4. As 
before, y'(2) = y'iX) + P4, and the children of the root are the vertices {8, 2, 3} for which x v G 
(y'(l),y'(2)). As before, we next examine the first child w 2 = 8 of the root, set y'(3) = y'(2)+p 8 , 
and let the children of 8 be the vertices {7, 1} for which x v G (y'(2),y'(3)). At this stage the 
constructions differ. We next examine vertex 7, being the first child of vertex 8, by setting 
y'(4) = 2/'(3) + P7] the children of vertex 8 are the vertices v with x v G (y'(3), y'(4)), and it 
turns out there are no such vertices. We continue examining vertices in the depth-first order 
4,8,7,1,5,2,6,3. 

As with the breadth-first construction, the point of the depth-first construction is that the 
excursion function F cxc,p (-) tells us something about the distribution of the tree. For each 
vertex v of ^p epth (xi, . . . , x n ) there is a path root = y , yi, . . . , yj = v from the root to v. For 
each < i < j the vertex y i+ i is a child of vertex yi\ let y^i, ^2, • • • be the later children of y^ 
and let y^i, yj^, ... be all children of v. Write J\f(v) = Uo<j<j{?/i,i, yi } 2, ■ ■ •}• 

In the w-scale of F exc,p (w), we finish "examining" vertex Wi at time y*(i) := y'(i) — y'(l)- 
For vertex v = Wi set e(v) = y*{i). Then the relevant property of _F exc,p is 

F exc *(e(v)) = J2 P*» Vv - ( 15 ) 

See Figure [7| for illustration. As before, in Figure [7| the position of the jump of height Pi is 
moved from Xi to x\ := Xi — y'(l) mod 1. At first sight, relation (|T3|l may not look useful. But 
we shall see in section 16.21 that in the asymptotic regime the right side of (fTB^l can be related 

ancestor of vP w wn i cn i n turn relates to the height of v. 
Remark. We might alternatively have defined the tree ^p epth (a;i, . . . , x n ) in a way that would 
have been less suited for the forthcoming analysis, but which is worth mentioning. It is based 
on the LIFO-queuing system construction of Galton- Watson trees in Le Gall-Le Jan which 
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Figure 6: The construction of the tree ^p 6pth (xi, . . . , x n ) 
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Figure 7: Relation (fTo"]) in the depth-first construction 
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we sketch here. Imagine vertex i is a customer in a line which requires a treatment time pi. 
The customer i arrives at time Xj and customers are treated according to the Last In First Out 
rule. After relocating the the time-origin is at the time when the minimum of the bridge F p is 
attained, the first customer in line will also be the last to get out. Then we say that vertex i 
is a parent of vertex j if customer j arrives in a time-interval when i was being treated. Notice 
that the tree thus defined is in general different from ipp epth (xi, . . . , x n ). 

It is easy to see, using induction and the same kind of arguments as above, that taking 
Xi, . . . , x n to be independent uniform random variables builds a p-tree (in order that i has k 
children, k uniform random variables must interrupt the service of i which takes total time pi, so 
this has probability pf). It is also easy that the order of customer arrivals (after relocating the 
time origin) corresponds to the depth-first order on the tree. In particular, the cyclic depth-first 
random order of vertices in a p-tree is the uniform cyclic order on the n vertices. 



4 Convergence of p-trees to the ICRT 

Here we review known results concerning convergence of p-trees to the ICRT, and spotlight 
what new results are required to prove Theorems Q and 121 

The general notion of exploration process of a continuum random tree can be reinter- 
preted as follows. Fix J > 1. Let (Uj, 1 < j < J) be independent [7(0, 1) r.v.s and let 
£/(i) < U(2) < ■ ■ ■ < U(j) be their order statistics. To an excursion- type process (H s , < s < 1) 
associate the random 2 J — 1-vector 

I Hu inf H s , Hu inf H s ,... } H v I . (16) 

V C ' U m <s<U (2) (2) U (2) <s<U (3) (J 7 

This specifies a random tree-with-edge-lengths, with J leaves, as follows. 

• The path from the root to the i'th leaf has length Hjj . 

• The paths from the root to the z'th leaf and from the root to the (i + l)'st leaf have their 
branchpoint at distance infc/ (i) < s <c/ {i+1) H s . 

Now label the i'th leaf as vertex i', where Uu\ = Ui>. Write the resulting tree as Tj 1 . Call this 
the sampling a function construction. 

On the other hand one can use a continuum random tree T to define a random tree-with- 
edge-lengths Tj as follows. 

• Take a realization of T. 

• From the mass measure on that realization, pick independently J points and label them as 
{1,2,. ..,J}. 

• Construct the spanning tree on those J points and the root; this is the realization of Tj. 
Call this the sampling a CRT construction. 

As discussed in detail in jS], the relationship 

the exploration process of T is distributed as (H s , < s < 1) 
is equivalent to 

Tj^Tf, VJ>1, 
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(the background hypotheses in [3| were rather different, assuming path-continuity for instance, 
but the ideas go through to our setting.) In our setting, there is an explicit description of the 
distribution of the spanning tree Tf derived from the ICRT T e (see [8J), so to prove Theorem 
[T]it is enough to verify 

for Y = Y° defined at (0). In principle one might verify (fTTj) directly, but this seems difficult 
even in the case J = 1. Instead we shall rely on weak convergence arguments, starting with 
the known Proposition ^] below. 

Consider a probability distribution p = (pi, . . . ,p n ) which is ranked: p 1 > p 2 > . . . > p n > 0. 
In the associated p-tree (JSJ), pick J vertices independently from distribution p, label them as 
[J] in order of pick, take the spanning tree on the root and these J vertices, regard each edge as 
having length 1, and then delete degree-2 vertices to form edges of positive integer length. Call 
the resulting random tree Sj. Define er(p) := \f*^~pj- Now consider a sequence p n = (p n i) of 
ranked probability distributions which satisfy 

lima(p n ) = 0; limp ni /er(p n ) = i( 1 < % < I; limp„ i /cr(p n ) = 0, % > I (18) 

n n n 

for some limit = (8 , . . . ,9j) G ©finite- For a tree t and a real constant a > define a Cg> t to 
be the tree obtained from t by multiplying edge-lengths by a. The following result summarizes 
Propositions 2, 3 and 5(b) of [9]. Recall Tf is obtained by sampling the ICRT T e . 

Proposition 1. For a sequence p = p n satisfying hltfy . as n — ► oo 

a(p)®S p J ^rf, J>1. 

The tree Sj may not be well-defined because two of the J sampled vertices may be the same; 
but part of Proposition ^ is that this probability tends to zero. 

Now consider the "bridge" process F p at (jlOj). where from now on the jump times x±, . . . ,x n 
are uniformly distributed independent random variables. Standard results going back to Kallen- 
berg [20J show that, under the asymptotic regime (jTSjl . 

(a-\p)F p {s), < s < 1) (X s br ' e , < s < 1), 

where X hT,e is defined at (jlj). It follows by an argument that can be found e.g. in (using the 
continuity of the bridge process at its minimum) that the associated excursion process F exc,p 
at (fT^j) satisfies 

(a-\p)F^(s), < s < 1) ± (Xf , < s < 1) (19) 

for X defined at ©. 

Recall from section El how (Yf) is constructed as a modification of (Xf). We next describe 
a parallel modification of F exc,p to construct a process G p . Given a realization of the p-tree 
obtained via the depth-first construction illustrated in Figure [7[ and given / > 0, let Bi C [n] 
be the set of vertices which are the child of some vertex i in from {1, . . . , /}. In the setting of 
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the depth-first construction of the p-tree from _F exc,p , illustrated in Figured for every vertex 



v G £>j, define 



p v (u) = < u < x\ 

= p v x\ < u < e(v) — p v 

= e(v) — u e{y ) — p v < u < e(v) 

= e(v) <u<l. 



(20) 



and then let 



and 



= (21) 

veBi 

I 

G*(u) = F^(u)-J2rf(u). (22) 

i=i 

We will show in section I^TTl that (fTU|) extends to 

Proposition 2. For a sequence p = p n satisfying with limit 6 = (8q, . . . ,6j), as n — ► oo 

(a-\p)G^(u), < u < 1) (Y (u), < w < 1) 

for Y e defined at Fty, for the Skorokhod topology. 

We finally come to the key issue; we want to show that G^(-) approximates the (discrete) 
exploration process. In the depth-first construction of the p-tree T from _F exc,p , we examine 
vertex Wi during (y*(i — l),y*(i)]. Define 

H p (u) := height of Wi'mT ; u E {y*{i - l),y*{i)]. (23) 

Roughly, we show that realizations of Y<r(p)if p (-) and of cr _1 (p)G' p (-) are close. Precisely, we 
will prove the following in section 16.21 

Proposition 3. Let G ©finite- There exists a sequence p = p„ satisfying with limit 6, 
such that as n — ► oo, 



sup 

ue[o,i] 



^0. 



The next result, Lemma El relates the exploration process H p at ()23|) to the spanning trees 
S p . This idea was used in ([Ej; proof of Proposition 7) but we say it more carefully here. 
Given u 1 G (0, 1) define, as in 



w 1 = Wi for i specified by u 1 G (y*(i — l),y*(i)]. 

Given < u 1 < u 2 < 1, define w 2 similarly, and let vertex b be the branchpoint of the paths 
from the root to vertices w 1 and w 2 . Distinguish two cases. 

Case (i): w 1 = w 2 or w 1 is an ancestor of w 2 . In this case b = w 1 and so trivially lit (6) = 
min u i< u < n 2 H p (u). 

Case (ii): otherwise, b is a strict ancestor of both w 1 and w 2 . In this case we assert 

ht(6) = min H p (u) - 1, 

u 1 <u<u 2 
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because vertex b appears, in the depth- first order, strictly before vertex w 1 . Then consider the 
set of vertices between w 1 and w 2 (inclusive) in the depth first order. This set contains the 
child w* of b which is an ancestor of w 2 or is w 2 itself, and ht(w*) = ht(fe) + 1. But the set 
cannot contain any vertex of lesser height. 

Now the length of the interval (y*(i — l),y*(i)] equals p Wi by construction. So if U 1 has 
uniform distribution on (0, 1) then the corresponding vertex W 1 at (J23|) has distribution p. 
Combining with the discussion above regarding branchpoint heights gives 

Lemma 3. Fix p, make the depth-first construction of a p-tree and define H p by \2'J\) . Fix J. 
Take U%, . . . ,Uj independent uniform (0, 1) and use them and H p to define a tree-with- edge- 
lengths Tf via the "sampling a function" construction below Mb\) . Then this tree agrees, up 
to perhaps changing heights of branchpoints by 1, with a tree distributed as the tree Sj defined 
above / TT5j) . 

4.1 Proof of Theorem [T] 

We now show how the ingredients above (of which, Proposition |21 and Proposition El remain to 
be proved later) are enough to prove Theorem ^ 

Let p = p„ satisfy (JTHJ) with limit 6 £ ©finite- Fix J and take independent Ui,...,Uj with 
uniform (0, 1) distribution. Proposition |21 implies that as n — ► oo 

a-\p) (g p (U {1) ) inf G P (s),G»(U {2) ) inf G?(«), . . . , G?(U {J) )) 

^(Y\U {1) ) inf Y\slY\U m ) inf Y°(s), . . . , Y°(U {J) )) . 
By making the particular choice of (p n ) used in Proposition El 

\9la{p) inf H p (s),H p (U {2) ) inf H»(s), . . . , H»(U {J) )) 

\ U (1) <s<U (2) U {2 )<s<U (3) J 

^(y%U {1) ) inf Y%s\Y%U m ) inf Y%s), . . . , Y (U (J) )) . (24) 
Appealing to Lemma El this implies 

§0 o V(p) ® s p j - rj 

where the right side denotes the tree-with-edge-lengths obtained from sampling the function 
Y , and where convergence is the natural notion of convergence of shapes and edge-lengths 
( [TU] sec. 2.1). Rescaling by a constant factor, 

But Proposition [T] showed 

cr(p) ® S» S Tf 
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where the right side is the random tree-with-edge-lengths obtained by sampling the ICRT T e . 
So we have established (fTTj) and thereby proved Theorem [T] in the case 6 G 0fi n i te - 

In the case J2i @i < oo, write n for the truncated sequence (9 , . . . ,9 n ,0, . . .), and recall 
from Lemma El that Y n = Y " converges uniformly to Y e . By previous considerations this 
entails 

2 j ~^ 2 j 

for every J > 1. On the other hand, we have proved that the left-hand term has the same 
law as c(6 n )~ l ® Tj ' where c(6 n ) = {^2 0<i<n 9f)~ l l 2 is the renormalization constant so 
that c(6 n )6 n G ©. It thus remain to show that this converges to T e . Plainly the term c(6 n ) 
converges to 1 and is unimportant. The result is then straightforward from the line-breaking 
construction of the ICRT: Tf can be build out of the first (at most) 2J points (cutpoints and 
their respective joinpoints) of the superimposition of infinitely many Poisson point processes on 
the line (0, oo). It is easily checked that taking only the superimposition of the n first Poisson 

c( Q n )Q n 

processes allows us to construct jointly a reduced tree with same law as c 

same probability space. So for n large the first 2 J points of both point processes coincide and 

we have actually c{6 n )~ l Tj^ e '° = Tf on this probability space. □ 

Remark. Theorem ^ essentially consists of an "identify the limit" problem, and that is why 
we are free to choose the approximating p n in Proposition El But having proved Theorem Q we 
can reverse the proof above to show that ()24|) holds true for any p satisfying (|18j) with limiting 
6 G 0finite- Indeed, the convergence in (}2~4"j) is equivalent to that of er(p) <g) Sj to Tf for every 
J. 



4.2 Skorokhod convergence of the discrete exploration process 

Suppose again that the ranked probability p satisfies ()18|) with limit 6 G ©finite with length I. 
As observed in JU] (Theorem 5 and Proposition 7), the convergence in (J2"^j) is equivalent to 
weak convergence of the rescaled exploration process to Y , but using a certain topology on 
function space which is weaker than the usual Skorokhod topology. As noted in ^U] Example 
28, assumption (J18)) is paradoxically not sufficient to ensure convergence in the usual Skorokhod 
topology; the obstacle in that example was the presence of exponentially many (in terms of 
l/cr(p)) exponentially small p- values. In this section we present some crude sufficient conditions 
(j25l26j) ; PropositionElwill be a natural consequence of the proof in section Id^ The hypotheses 
are as follows. 

First, we prevent very small p- values by making the assumption 

1/p* = o(exp(a/c(p))) for all a > (25) 

where 

p* := minpj. 

i 

Second, we will assume that most of the small p(-)-weights, as compared with the I first, 
are of order cr(p) 2 . Write p = (0, 0, . . . ,pi+i, ■ ■ ■ ,p n ) for the sequence obtained from p by 
truncating the first I terms. Let £ have distribution p on [n], and write for the r.v. p^. 
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We assume that there exists some r.v. < Q < oo such that the following "moment generating 
function" convergence holds: 

= E[exp\Q] < oo, (26) 

for every A in some neighborhood of 0. This implies that p(£)/°"(p) 2 ~^ Qi an d also that the 
moments of all order exist and converge to those of Q. 
Then we have 

Theorem 3. Suppose p satisfies KTty) with limit 6 G ©finite- Under extra hypotheses $£$£B), 

°(P)H P - %Y e (27) 

in the usual Skorokhod topology. 

Remark. The proof (section [fi.2|) rests upon applying the elementary large deviation inequality 
P(S > s) < e~~ Xs E exp(AS') to the independent sums involved in (|39I41|) . Hypothesis (|2Hj) 
is designed to make the application very easy; it could surely be replaced by much weaker 
assumptions, such as plain moment convergence conditions. 

We would also guess that the convergence in (}2Tj) also holds with H p replaced by more general 
exploration processes, and in particular the "classical" one, where each vertex v is visited during 
an interval of length 1/n instead of p v , or the Harris (or contour) walk on the tree (see e.g. 
[T7| Chapter 2]). We can easily verify the first guess. Consider the p-tree i[)p epth (Xi, . . . ,X n ) 
defined as in section I3~2l out of uniformly distributed independent r.v. Write W\, . . . , w n for the 
vertices in depth-first order, and let H n (t) be the height of the Wi for which i/n<t<(i + l)/n 
(and with the convention H n (l) = H n (l—)). 

Corollary 3. Suppose p satisfies IT^j) with limit 6 G ©finite- Under extra hypotheses t2ffl2h)) . 

a(p)H n S Ay* (28) 

in the usual Skorokhod topology. 




Proof. By the functional weak law of large numbers for sampling without replacement, we know 
that if 7r is a uniform random permutation of the n first integers, the fact that maxjp n j — > 
as n —>■ oo implies that if (S°(t),0 < t < 1) is the linear interpolation between points 

Kk<iP*(i))'® — i — n ) t nen su Po<t<i ~ A ~~ > in probability. Now by the 

remark at the end of Sect. 13 .2\ the cyclic order on vertices associated to the depth-first order 
is uniform, so with the above notation for % = wi, . . . , w n the linear interpolation S n between 
points ((i/n, J2 1<k<i p Wk ), < i < n) converges uniformly to the identity in probability, since it 
is a (random) cyclic permutation of a function distributed as S®. Noticing that H n = H p o S n , 
the result follows. □ 

The convergence of the Harris walk follows from this proposition by the arguments in [TT| 
Chapter 2.4]. 
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5 Height profile 



This section is devoted to the proof of Theorem |21 In this section, we do not assume that 6 G 
has finite length nor that #o > 0. 



5.1 Continuity of the cumulative height profile 

We first prove the following intermediate lemma. Recall that the cumulative height process of 
the T 9 is defined as W e (.) = fi {v G T e : ht(v) < .}, where \i B is the mass measure of T . 

Lemma 4. The cumulative height process W is continuous for a. a. realizations ofT . More- 
over, it has no flat interval, except its (possibly empty) final constancy interval, equal to 
[sup w6T0 ht(?;),oo). 

Proof of Lemma ^| Recall the recursive line-breaking construction of T e in the introduction, 
and the fact from [9 a that the tree constructed at stage J is distributed as the reduced tree 
Tf of Sect. El From this, we see that the leaves labelled 1,2,... are a.s. at pairwise different 
heights, meaning that the measure dW e has no atom. Moreover, if W e had a flat interval (other 
than the final constancy interval), this would mean that for some h < sup v£T e, no leaf picked 
according to the mass measure can have a height in say {h — e, h + e) for some e > 0. But let v 
be a vertex of T at height h. By the line-breaking construction, the fact that branches have 
size going to and the "dense" property of joinpoints, we can find a joinpoint rj* at a distance 
< e/2 of v and so that the corresponding branch has length rj < e/2. Since the leaves that are 
at the right-end of branches of the line-breaking construction are distributed as independent 
sampled leaves from the mass measure, this contradicts the above statement. □ 



5.2 Proof of Theorem [2] 



The reader can consult j22] for a similar treatment of convergence of the height profile of 
Galton- Watson trees to a time-changed excursion of a stable Levy process. 

Suppose that p = p n satisfies the asymptotic regime (jl8|) . Let T p be the p-tree, and T 
the limiting ICRT. Define W 9 as above and recall the notation u(h) in (jlHjl . For h > let 



W p {h) 



E 



Pv 



U 



h 



+ 1 



h 



k(p)J 



h > 



and W p (h) = u([h/a(p)]). Now let Ui, U 2 , ... be independent uniform(0, 1) random variables. 
The sequence ((W p )~ 1 (Uj), j > 1) has the law of the heights of an i.i.d. random sample of 
vertices of T p , chosen according to p, and the same holds for ((W )' 1 (Uj) , j > 1) and the 
tree T 8 , with the mass measure fi as common law. For J > 1 let Wj(h) be the associated 
empirical distribution of the first J terms, defined by 



W7(*) = 7l> 
J <=i 



L {0*»)- 1 Pi)<M 



1 J 

-j E 

j i=i 



<WP(ft)}> 
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and define W (h) in a similar way. 

By Proposition Q we have that the random Stieltjes measure dWf converges in law to dW 
as n — > oo for every J > 1. Moreover, the empirical measure of an i.i.d. J-sample of leaves 
distributed according to fi converges to fi , implying dW —> dW as J — > oo. Thus, for /i > 
and J n — > oo slowly enough, 

dW p ^ dW . 

•Jn 

Now let Fj(x) = J -1 5^i=i l{i/i<x} be the empirical distribution associated to the uniform 
variables U%, . . . ,Uj. Then sup ft>0 \Wf (h) — W p {h)\ < sup x . g [ 01 ] \Fj(x) — x\, which by the 
Glivenko-Cantelli Theorem converges to as J — > oo, and this convergence is uniform in n. 
Hence the random measure dW p converges in distribution to dW 9 for the weak topology on 
measures. Thanks to Lemma we may improve this to 

W p (-) 4 W e (-) 

where the convergence is weak convergence of processes for the topology of uniform convergence. 
It is then an elementary consequence of LemmaEJthat W P ((W P )^ 1 (-)) converges in law for the 
uniform convergence topology to the identity function on [0, 1]. 

Equation ()14j) can be rewritten as 

W p {h) = F exc ' p {W p {h)), h>0, (29) 

so the convergence in distribution of W p , the fact that its limit is strictly increasing and 
continuous, and (|T9*j) imply that the sequence of random processes (a(p) _1 W /p ) is tight. Thus, 
the pair (a(p)~ l W p , W p ) is tight, and up to extraction of a subsequence, we can suppose that 

(a(p)~ 1 W p ,W p ) — > (W, W') for some process W, and where W' has the same law as W . 

Suppose further by Skorokhod's embedding theorem that the convergence is almost-sure. By 
definition 

r h W p (u) 

/ — -^d M = W p {h-a{p)) + R{n,h) 
Jo CT (P) 

where R(n, h) < W p (h) — W p (h — cr(p)) goes to uniformly as n — > oo by continuity of the 
limiting W' . So necessarily, 



/ W{u)du = W' hl h>0 
Jo 



for every h > 0, so that the only possible limit W is the density of dW . Therefore, the height 
profile W of the ICRT exists and (a(p)- l W p ,W P ) (W ,W ). Looking back at flU we 
have 

a(p)- 1 H/ p ((iy p )- 1 (w)) = a(p)- 1 F cxc ' p (H/ p ((H/ p )- 1 ( M ))), < u < 1, 

so by the convergence of W P ((W P )~ 1 (-)) and we obtain convergence in distribution 

of the right-hand side to X . By the convergence in law of W p this finally implies that 

W 9 ((W°)-\-)) = X (-) and TheoremElis proved. □ 
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Proof of Corollary^ By the proof of LemmaEl the only constant interval of the width process 
of the ICRT is [sup v£T e ht(t>), oo). Thus the height of the tree, sup v€T e ht(t>), is the first point 

after which the width process remains constant. By (jSJ), this point has same law as J ds/Xf. 
□ 

6 The exploration process 

To shorten notation, for A C [n] we write p(A) for the quantity YljeAPr 
6.1 Proof of Proposition 2 

Let p satisfy (fTHj) for some limiting 6 Sfmitc, with length /. In this subsection we suppose that 
the p-tree T p is constructed from the process _F exc,p by the depth-first search construction of 
sectional Moreover, since we have (fTU|) the convergence in law a(p)~ l F cxc ' p —> X 6 , we suppose 

by Skorokhod's representation theorem that our probability space is such that the convergence 
holds almost surely. Recall that in the depth-first search construction of the p-tree out of the 
process F exc,p , the z-th examined vertex v = Wi is examined during an interval [e(v) — p v , e(v )), 
during which the labels of jumps of _F exc ' p determine the set B v of children of v. 

We begin with two useful observations. First, if v is a vertex of T p and if T p denotes the 
fringe subtree of T p rooted at v, that is, the subtree of descendents of v, then for every vertex 
w of T p one has 

F exc *(e(w)) > F cxc *(e(v)) - p(B v ). (30) 
To argue this, simply recall formula (fTB]) and notice that M{v) C Af(w) U B v . 

Second, notice that since maxj pj —>■ and the limiting process X is continuous except for 
a finite number / of upward jumps, we must necessarily have that a.s. as n — >• oo, 

rj n := max 

v<=[n] 

Lemma 5. Almost surely 

maxa(p)- 1 \ Pj - p(B j \ - 0. 

J€[n] 

Proof. As mentioned, for every vertex v G [n], 

F exc *(e(v)) - F c ^(e(v) - p v ) = p{B v ) - p, 
Consider the process F p ^ defined by 

F Pl (s)=F exc,p (s) _ Y,P-J{s> <} 

\<i<I 



inf (F 

ue[e(v)—p v ,e(v)) 



CXC,P ; 



— F 



exc,p. 



e(v 



-Pv)) 



ola 



(31) 
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where as above x\ is the time when _F exc ' p has its jump with size pi. Easily, er(p) 1 F P + converges 
in the Skorokhod space to the process X ^ defined by 



Ki<J 



where £, is the time when X jumps by This process is continuous, hence meoCjPj — > 
implies 

a(p) _1 max \F pl (e(v)) - F pi (e(v) -> 0. 

V 

Now the quantity F pJ -(e(t>)) — F p -l-(e(v) — p„) equals 

F exc ' p (e(v)) — F exc ' p (e(v) — p v ) - £ Rl^g^-p.,^)]} 

i<«</ 

= p(8„)-p„-p(£„n[/]) 

implying the lemma. □ 

Now, for v a non-root vertex of T p let /(f) be its parent. For i G [/] and n large enough, % 
is not the root (since the limiting X 6 does not begin with a jump), so f(i) exists. 

Lemma 6. Let i & I. Let A4(i) be the set of descendents of f(i) that come strictly before i in 
depth-first order. Suppose that f(i) [I] for n large enough. Then as n — > oo ; p{M.{i)) — > 
almost surely. 

Proof. A variation of ()30|) implies for any v G Ai(i) and n large that 

F^ p (e(v)) > F^ p {e{fm-p{B m \ [/]). (32) 

Indeed, it is clear that for n large the sets B v n [/] contain at most one element, otherwise 
the Skorokhod convergence o~(p 

yi F c X c,p _^ x e WQuld fail ag twQ 

or more upward jumps of 

non- negligible sizes could occur in an ultimately negligible interval. Moreover, for v G A4(i), it 
is clear that J\f(v) contains i, hence (I32j) . Thus 

^r^ CXC ' P(M) ^ FCXC ' P ( e (^^))) -*(%0 \ [I])~Vn, 

e(f(t))<u<e(f(i))+p(M(i)) 

with r/ n defined at (J3TJ) , since the vertices of Ji4(i) are visited during the interval [e(f(i)), e(f(i))+ 
p(M.{i))}. Since o"(p) _1 F exc ' p (e(/(2))) is easily seen to converge to Xf , by (pITj) . Lemma|S]and 
the fact that f{i) ^ [I] for n large, if p(M.{i)) did not converge to 0, by extracting along a 
subsequence we could find an interval [U,ti + e] with e > where > X , and this is a.s. 
impossible by Lemma H D 

The assumption that f(i) ^ [/] may look strange since it is intuitive that the child of some 
i G [I] is very unlikely to be in [J] for n large (e.g. by Theorem |2J). We actually have: 

Lemma 7. For every i G [I], almost surely, Bi PI [I] — /or n Zarge, and 

<7(p)- l p(fl() - ^, 
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Proof. By Lemma El it suffices to prove that a.s. for large n, Bi H [I] = 0. Suppose that there 
exist i, j G [I] such that j is the child of i in the p-tree infinitely often. Since / < oo, we may 
further suppose that f(i) ^ [I] by taking (up to extraction) the least such i in depth-first order. 
By definition, _F exc ' p has a jump with size i in the interval [e(f(i)) — pfu\,e(f(i))]. Moreover, 
it follows from the definition of M.{%) that e{i) — Pi = e(f(i)) + p(A4(i)). Since the vertex 
i is examined in the interval [e{i) — pi,e(i)} and p(M.{i)) — > by the preceding lemma, the 
fact that f(j) = i implies that the jumps with size pi and pj occur within a vanishing interval 
[e(f(i)) — PfU), Therefore, the Skorokhod convergence of cr(p) _1 F cxc ' p to X 9 would fail. 
□ 

Now recall the definition (|2~T|) of the processes rf used to build G p in section EJ and that x\ 
is the time when _F exc,p jumps by . 

Lemma 8. For every i G [I], as n — > oo ; we have 



*(p) 



-i 



inf F exc ' p (u) 

x'- <u<s 



,-iexc,p , 



rf(s) 







a.s. uniformly in s G [x^, e(i) +p(T i p )]. 



Proof. Let i G [I], and let Bi = {vi, v 2 , ■ ■ ■ , Vk} (with k = \Bi\) where v%, v 2 , ■ ■ ■ are in depth-first 
order. For 1 < j < k let also v'j be the last examined vertex of T p in depth-first order, that is, 
the predecessor of Vj + % if j < k. Then one has, for every 1 < j < k and w G T p 

F^(e(w))>F c ^(e(v,))-p(B V] ), 

as follows from f)30j) . Rewrite this as 

F exc ' p (eH) > F exc ' p (e(z)) - Pvr 

l<r<i-l 

and check that the right hand side equals F exc,p (e(v' i _ l )). In particular, we obtain 



inf F^ p (e(v)) - F cxc ' p (e«) + £ p Vr 

l<r<?-l 



< max p,, . 



Now check that for w a vertex of T p , one has rf (e(u;)) = J2j< r <kPv r - F° r s as m the statement 
of the lemma deduce, for n large (since Bi fl [/] = by Lemma |7j), 



inf F exc ' p (u) - F cxc ' p (e(i)) + p(Bi) - rf(u) 

u&[x'.,s] 



< 2 maxpj + Vn + r\ n 



where 



7]' = max \F exc ' p (u) - F exc ' p (e(i))\ 

x' t <u<e{i) 



which is o(a(p)) by LemmaEland the convergence cx(p) l F exc ' p — > X . We conclude, using the 
fact that a(p)- 1 F cxc ' p (a; / i -) which is equal to the limit of cr(p)- 1 (F exc ' p (e(z)) -p(Bi)), 

as follows from Lemmas El and [7| □ 
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Proof of Proposition® We prove that the process cr(p) _1 r p converges to the Rf of section|2in 
the Skorokhod topology, for every i. In view of Lemma |Hl and since by definition of one has 
Ti(u) = for u > e(z)+p(7^ p ), the only thing to do is to show that e(v' k ) = e{i)+p{Tf) converges 
to the Tj of section |21 Since e(v' h ) > inf{s > x\ : rf(s) = 0}, we obtain that liminf e(v' k ) > T^. 
Suppose £ = limsup e(v' k ) > 7$, and up to extraction suppose that £ is actually the limit of e(v' k ). 
From the fact that F exc ' p (e(^)) = F exc > p (e(i)) -p(Bi), hence a(p)- 1 F exc ' p (e«)) converges to 
Xf._ by Lemmas Hand [3 we would find £ > % with Xf = Xf._ and Xf > Xf._ for s £ [Ti,£], 
and this is almost surely impossible by Lemma ^ as Xf_ would be a local minimum of X e , 
attained at time Tj. 

Without extra argument we cannot conclude that the sum a(p)~ 1 (F cxc ' p — ^i=i r f) converges 
to X e — J2i=i ^Hi but this is nonetheless true for the following reason. The process Rf is 
continuous except for one jump at ti, and the process rf has precisely one jump with size p(Bi) 
at time x-, that is, at the same time as the jump of _F exc ' p with size pj. Together with Lemma 
we obtain the Skorokhod convergence o"(p) _1 G p — > Y e . □ 



6.2 Proof of Theorem El 

As above, we suppose that p is a ranked probability distribution satisfying (|18|) for some limiting 
6 with length /, and we suppose that the p-tree T p is obtained by the depth-first construction 
of section El out of the process _F exc,p . We are going to show the following result: 

Proposition 4. Under extra hypotheses \25&2b\) on (p^), as n — > oo 



max 



±0. 



We first show how Theorem El and Proposition 01 are easy consequences of Proposition 01 

Proof of Theorem^ Since cr(p) _1 G p converges uniformly in distribution to a continuous pro- 
cess, and since H p does not vary in the intervals [e(v) —p v , e(v )), the last displayed convergence 
extends to 



max 

«e[o,i] 



e -o^ElH p {u)-a{p)- 1 G p {u) ^0, (33) 



and then Proposition 121 implies Theorem El □ 

Proof of Proposition [3J For Proposition El we choose the following approximating sequence 
p( n+/ ) for e ©finite with length /. Given n, let z n = ^Ju/Oq, s n = n + z n ^2 1<iKI Q% and 

P* = *t if 1 < i < / . . 

p l = 1 - if 7 + 1 < i < n + L 1 } 

It is trivial to see that this sequence fulfills hypotheses (J25I26)) . Hence (J33|) is satisfied, and 
Proposition El is an immediate consequence. □ 

We now mention three consequences of hypotheses (|25l26j) that will be used later. First, 
notice that p* < \jn since p is a probability on [n], so implies 

n = o(exp(a/a(p))) for all a > 0. (35) 
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Second, implies convergence of all moments of p(£)/cr(p) 2 , and in particular 

E 



( Pit) 

Up) 2 



£p?Mp) 2 
i-£p 2 Mp) 2 

ie[j] 
fig = E{Q). 



(36) 



£^ [exp(AQ) - 1 - XQ] < oo. 



(37) 



Third, for every A in a neighborhood of 0, 

„(p) 2 £[exp(i)-l-M] 

i«f[J] 

Indeed, the left side can be rewritten as i£ f exp(^^i) — 1 — ^£^1 j ; where the function 

/(x) = (e Xx — 1 — Xx)/x is understood to equal its limit at 0. Since it is bounded in a 
neighborhood of and dominated by e Xx near oo, the convergence of this expectation is an easy 
consequence of (J2BJ. 

The first step in the proof of Proposition's to relate H(-) to another function £?(•) measuring 
"sum of small p- values along path to root". Let A(v) be the set of ancestors of v in the p-tree, 
and let 

g(v):=p(A(v)\[I}). (38) 
Lemma 9. Under extra hypotheses as n — > oo for fixed K > 



max |a(p)£ 2 ht(t,) - ^p)" 1 ^) | A 0. 

i>:ht(i>)<K/o-(p) 

Proof. Let V be a p-distributed random vertex. Fix e > 0. It is enough to prove that as 
n — ► oo 

P(|(7(p)0 o 2 ht(y)- ( 7(p)- 1 g(e(y))| >e, <r(p)ht(V) <#) =o(p.). 

Let £ have distribution p on [n] and let i > 1) be i.i.d. By the "birthday tree" construction 
of the p-tree Corollary 3] we have equality of joint distributions 

T-1 



1=1 



where 

T := min{j > 2 : = for some 1 < i < j} 
is the first repeat time in the sequence £j. So it is enough to prove 

T-1 
8=1 



>£, a(p)(T-2) <K = o(p,) 
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We may replace T — 2 by T — 1 and 6$ by E( -££L ) by the above remark. Rewriting in terms 



T-l 



we need to prove 



i=i 



>e/<r(p), T-l<K/a(p) 



o(p*)- 



Now we are dealing with a mean-zero random walk, and classical fluctuation inequalities (e.g. 
[T5] Exercise 1.8.9) reduce the problem to proving the fixed-time bound 



P 



E p&< 

l<i<K/a{p) 



> e/cr(p) = o(p*) 



(39) 



We now appeal to assumption (J26)) . which basically says that the sums in question behave as 
if the summands had distribution Q — 9$ not depending on n. More precisely, the elementary 
large deviation inequality applied to the probability in (|3*9^1 but without the absolute values 
implies that for any small A > 0, 

logP E ^ ^ + ^log(£(exp(Ap(£)))- 



i=l 



<r(p) <r(p) 



Assumption and the convergence of the expectation of allows us to rewrite the log 
term on the right as 



^(P) 



log£(exp(A(Q-0 o 2 ))) + 



Krj x (n) 



a(p)E(exp(X(Q - e 2 ))Y 



where r}\{n) — > as n — > oo for any fixed A. We now choose A small enough so that — Xe + 
KlogE(exp(X(Q — = — ^ < and we let n — > oo, obtaining the bound exp(— <5'/cr(p)), 
for some 5' > 0, for the probability in ([3*9^1 without absolute values, but the other side of the 
inequality is similar. Now assumption gives the desired bound (|3*9^l . □ 

The next, rather strange-looking lemma does most of the work in relating the processes G7 (•) 
and £/(•). 

Given a probability distribution p on [n] and given a subset Ac [n], let q be the probability 
distribution obtained by lumping the points A into a single point; that is, q\ = p(A) and the 
multiset {qi,i > 2} is the multiset {pi,i G" A}. We also let X be the set of "large" q-values, 
except q\. Precisely, X is such that the multisets {p v , v G [I] \ A} — {q v , v G X} are equal. Then 

Lemma 10. Suppose p = p 1 -™) satisfies the regime and extra hypotheses \25¥Ab}) . Let 
A = AW C [n] and define q as above. Define a random variable X = X(q) as follows. Take 
a q-tree, condition on vertex 1 being the root. Let B\ be the set of children of 1, and for each 
v G B\ toss two coins c\ and c^, c\ a fair coin and P(c2 = Heads) = p(A \ [I])/p(A), and set 

X := /]{q v ■ v G B\ \X, coins c\ and C2 land Heads}. 
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Suppose qi < Ka(p) and set q\ = p(A \ [I]). Then for fixed e > there exists 5 = 6(e, K) > 
with 

P(\X - > e«r(p)) < exp(-5Mp)) = o(l/n), 
where the o(l/n) is thus uniform over qi < Ka(p). 

Proof. Consider the random variable 

igAU[I] 

where the (Ui) are independent uniform(0, 1). The key relation is 

P(X e •) < ±-P{Y e •)• (40) 

This follows from the breadth-first construction of the p-trees. In that construction of a q-tree, 
vertices i are associated with uniform(0, 1) r.v.'s U- in such a way that, if vertex 1 happens to 
be the root, then the children v of 1 are the vertices v for which U v := U' v — U[ mod 1 falls 
within (0,gi). Thus, writing 

X' -.= J2iQv:veB 1 \l} 

y ■■= Yl p*%i<»} 

i<£AU[I] 



we have 
So 



X' = Y' on the event { vertex 1 is root }. 

P(X' e -I 1 is root) < J"( Y ' e -) r = f P(Y' e 
v 1 ; ~ P( 1 is root) 91 v 



The stated inequality (jlUj) follows by applying an independent Bernoulli(gi/(2gi)) thinning 
procedure to both sides. 

Now write c = qi/2 and let us study the centered version of Y: 

Y:= PiiMu^-c). (41) 

i£All[I] 

The elementary large deviation bound, applied to y/cr(p) 2 , is: for arbitrary A > 0, 

-Xe 



We calculate 



log P(Y > e«r(p)) < — ^ + log£exp(AY/cx(p) 2 ). 
ct (p) 



logEexp(Af/ ( r(p) 2 ) 

E {^ + log[l + c(e A ^ 2 -l)]} 



i£A\J[I] 



-ten 



\pi/cr(p) 2 _ i _ Xpj 
6 1 ^(P) 2 
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since the quantities we are summing are positive, and by (|37jl the bound is asymptotic to 
ca(p)~ 2 $(A) for 

$(A) := E± [exp(AQ) - 1 - XQ) . 

By hypothesis c := q~i/2 < Ko~(p), so ccr(p)~ 2 < ifer(p) -1 . So there is a constant Ci = C\{K) 
such that 

logP(r > ea{p)) < ^ (-As + d$(A)) . 

But $'(0) = and so $(A) = o(A) as A j 0, so the right side is strictly negative for small A > 0. 
So there exists 6% = S±(e, K) > such that 

P(y> e< 7(p))<exp(-«5iMp))- 
Since K — F = c^^ Au ^]9j < qi/2 we have established the one-sided inequality 

P(Y - \q t > sa(p)) < exp(-5 1 /a(p)). 

The other side of the inequality is similar except for this last step: we cannot bound so easily 
the quantity Y — Y . However, by (|T5j). 

i 

Pi = 1 - P(A U [/]) > 1 - q x - Y,Pi > 1 - C2^(p) 

i<£AU[I] i=l 

for some C*2 = C^ii") < oo. Thus K — K > c(l — C2<j{p)) and we can conclude as above by the 
existence of 5 2 = 5 2 (e,K) satisfying 

P{\q t -Y> ea{p)) < exp(-5 2 /a(p)). 

So, letting 5' = 6i A 5 2 , 

P(\Y - \q x \ > ea{p)) < 2ex P (-5' /a(p)). 

Now (P0|) and hypothesis (|23j) and its consequence (|33j) establish Lemma ITU1 (with any 5 < 5'). 
□ 

For the next lemma, recall the definition of J\f(v) around (|15|) and let J\f*(v) be the subset 
of vertices of M(v ) which are not in [/] and whose parent is not in [/] either. 

Lemma 11. Fix j G [n] and a subset A C [n] with j G A. Take a random p-tree and 
condition on A(j) = A. Let also vi,...,Vk be the children of j that are not in [I] and let 
C *U) = Ekkj; hPvt; where the hi 's are independent Bernoulli random variables with parameter 
1/2, independent of the p-tree. Define 

X*:=p{N*{j))-c*{j). 

Then X* is distributed as the random variable X in LemmaUIK 
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Proof. Order A as t>o, vi, ■ ■ • > j, arbitrarily except for ending with j. Let T* be the set of rooted 
trees on [n] with root v whose path to j is the path v , v i, . . . , j. Let T® be the set of rooted 
trees on [n] \ A U {©} with root ©. There is a natural map T* — > T®: "lump the vertices 
in A together into a single vertex ©". It is straightforward to check, from the combinatorial 
definition (see e.g. |2Tj) of p-tree, that this map takes the distribution of p-tree (conditioned 
to T*) into the distribution of a q-tree (conditioned on having root ©). Also, we have the 
extra constraint in X* that the parents of the vertices we are summing on are not in [I], but 
conditionally on the fact that v has some parent in A, it is easy that the parent is in [I] with 
probability p(A D [I])/p(A). This corresponds to the biased coin-tosses in Lemma ITU1 And the 
fair coin-tosses in Lemma ITU1 reflect the random ordering of branches used in defining the depth- 
first order, as can be seen from the definition in Section |3] (the set of children of any vertex is 
put in exchangeable random order). The only exception is on children of j itself, which are all 
in Af*(v), so the 6;'s are designated to artificially remove each of them with probability 1/2. 
This establishes the lemma. □ 

The importance of the lemma is explained by the following formula 

max \G^(e(v)) -p(J\f*(v)) - c*(v)\ = o(cx(p)) in probability. (42) 

V 

Since asymptotically we know that children of i £ [I] are not in [I] , and since by Lemmas 
and[3 

a(p)- WKB, \ 0, (43) 

so in particular max.,- cr(p) _1 c*( - 7') — > in probability with the notations above, this is a straight- 
forward consequence of Lemma [7| and 

Lemma 12. Suppose that no vertex i £ [/] has a child that is also in [I], then we have for 
every v 

G»(e(v))=pW*(v))- (Pi-P&i))- ( 44 ) 

i£Af(v)n[I] 

Proof. Recall by definition (|2*0|) of the processes pk that if k is a child of some i £ [I], pk(e(v)) = 
Pk whenever v is examined after the parent f(i) of i and strictly before k in depth- first order, 
and pk{e{v)) = otherwise. As a consequence of (|T5j). we thus have 

G?(e(u)) = P{N(V)) - Yl PkMe(f(i))<e(v)<e(k)}- 

ie[i],keBi 

A careful examination of this formula shows that a term in the sum on the right is not zero if 
either v has some ancestor i £ [I], or some ancestor of v has a child % £ [I] that is after v in 
depth-first order, and these situations are exclusive by the assumption that vertices of [I] do 
not have children in [I]. In the first case, the formula says that we remove all the p- values of 
children of i that are after v in depth-first order, in the second case, it says that we remove the 
p- values of all the children of i, implying (J44I) . □ 

Proof of Proposition^ Fix e > and consider arbitrary v £ [n]. Recall the definition of 
A(v),Q(v),Af*(v),c*(v). We assert, from Lemmas ITU1 and ITTI that for any K > there exists 
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5 = 6(e, K) with 

P (\p{M\v)) - c*(v) - \G{v)\ > ea(p)\A(v)) < exp(-5/a(p)) on {Q(v) < (K + 2e)a(p)}. 

(45) 

To argue (J43J) . note that conditioning on the set A = A(v) of vertices in the path from the 
root to v determines the value Q(v) := p(A(v) \ [I]) = qi say. Then Lemmas ITUl HP imply that 
the conditional distribution of p(J\f*(v)) — c*(v) has the distribution of X in Lemma ITUl The 
conclusion of Lemma now gives ()45|) . 

So for fixed K and arbitrary v G [n] 

P Qp(.Af») - <ty) " |<^)| > £(T (P)' ^ + 2 *Mp)) < exp(-5/a(p)) = o(l/n). 

Using Boole's inequality gives 

P (^jb(A/"*(f )) - c*(v) - ~G(v)\ > e for some v with ^y^(^) < K + 2ej = o(l). 

By we may replace p(J\f*(v )) — c*(i>) by (e(i> )) in the previous expression. We now use 
a slightly fussy truncation procedure. Imposing an extra constraint, 

P (jfamaxG?{e(v)) < K, ^\G»(e(v)) - \g{v)\ > e for some v 

w ith^g(v)<K + 2s)=o(l). (46) 

We claim that we can remove the restriction on v to get 

P (^maxGP(e(,)) < K,^\C?(e(v)) - \G{v)\ > e for some v) = o(l). (47) 

Indeed, if v has parent v' then Q{y) — Qiy') < maxi^jjp, = o(cx(p)). So if there exists a v with 
2^t^j G{v) > K + 2e then (for large n) there is an ancestor w with K + e < 2 J(p) G( w ) < K + 2e. 
But if the first event in (pltjj) occurs, one obviously cannot have cr(p) _1 |Gj (e(w)) — \Q{w)\ < e 
by definition of w. Thus the probability in (|47|) is bounded by twice the probability in ([46)1 . 
This establishes f)47p . Since Proposition |2] implies max„ G^(e(f )) is tight as n — > oo, (|47|) 
implies 

max^lG^))- |^)|^0. (48) 

Now let us show that the sequence (cr(p) max„ e [ n ] ht(v), n > 1) is tight. Fix e > and let 
K > such that 

p(^maxG?(e(<;))>if) < e/2, 

Then 

P f(r(p)maxht(t») > K + l) < e/2 + P (a(p) maxht(u) > if + 1, maxG?(eM) < if) , 

\ v J \ v a vP) v J 

but by the same kind of argument as above, if cr(p) max„ ht(f) > if, for n large there must 
exist some w with if + 1/2 < er(p)ht(w;) < if + 1. By Lemma Owe then have also if + 1/2 < 
o"(p) _1 ^(^) < if + 1 with high probability, so (pfHj) implies that the right-hand side in the last 
expression is < e/2 for n large. This being proved, Lemma H3 rewrites as max„ \<j(p)~ l Q(v) — 
er(p)#oht(i>)| = o(l) in probability, which together with (|4~K|) establishes the proposition. □ 
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7 Miscellaneous comments 



1. In principle Corollary Ogives a criterion for boundedness of T , but one would prefer to have 
a condition directly in terms of 0. Here are some steps in that direction. From |2l] Theorem 
1.1], the process X hr,e may be put in the form X^ r,e = X* + X 2 , s > 0, where X 1 is a Levy 
process on [0, oo) and X 2 has exchangeable increments on [0, 1] and in a certain sense behaves 
less wildly than X 1 . Precisely, X 1 has no drift, its Gaussian part is 8q and its Levy measure is 
A(dx) = J2i>i ^(dx), where S y (dx) is the Dirac mass at y. On the other hand, X 2 can be put 
in the form 

X 2 = -Xls + J2^(Ms<v l} ~ s) 



i>l 



for some square-summable random family (r^) and a sequence Vi of independent r.v.'s with 
uniform law (notice that X 1 and X 2 are by no means independent). Then, writing K xbr ,e = 
inf {c > : J2i>i ®t < °°} an d K x 2 = inf{c > : J2i>i T t < °°} we have that 

Kx* < — r > ( 49 ) 

which is what we mean by "behaving less wildly". It is therefore reasonable that the problem 
on the fmiteness of the integral J ds/Xf, which is a problem dealing with the behavior at 
the left of the overall minimum of X br,e , should be replaced by a problem on the Levy process 
X 1 as soon as one can show that the overall minimum of X hr,e is actually attained at a local 
minimum of X 1 , and such that locally X 2 is negligible compared to X 1 at this time. Since X 1 
has no negative jumps, the time-reversed process has no positive jumps, and such questions are 
addressed in Bertoin ^Tj and Millar [29J. Pushing the intuition one step further, by analogy 
with the standard criterion for non-extinction of continuous-state branching processes and the 
analogy of ICRT's and Levy trees mentioned above, we conjecture that f°° \& _1 (A)dA < oo is 
equivalent to the boundedness of T e , where \l/ is the Laplace exponent of X 1 : 

#(A) = 9 \ 2 /2 + ^(exp(-A^) - 1 + \6i). 

i>l 



2. As we mentioned before, a natural guess would be that the exploration process of T e in 
the general case #0 > is -3jY 6 '. It is more difficult to get an intuition of what the exploration 

process of T e should be in the cases when #0 — 0, when the Brownian part of X 6 vanishes. By 
the general theory of continuum random trees, it should be easy to prove that compactness of 
the tree is enough to obtain the existence of an exploration process for T , which is the weak 
limit of 2(0Q)~ 2 Y en for some n G — > pointwise with 9q > for every n. But this would not 
tell much about the look of this process. Another way would be to try to generalize local time 
methods used in but these do not seem to adapt so easily to bridges with exchangeable 
increments instead of Levy processes. 

Acknowledgment. Thanks to an anonymous referee for a careful reading of a former version 
of the paper. 



35 



References 



[1] D.J. Aldous: The continuum random tree I. Ann. Probab. 19, 1-28 (1991). MR1085326 

[2] D.J. Aldous: The continuum random tree II: an overview. In M.T. Barlow and N.H. 
Bingham, editors, Stochastic Analysis, pp. 23-70. Cambridge University Press (1991). 
MR1 166406 

[3] D.J. Aldous: The continuum random tree III. Ann. Probab. 21,248-289 (1993). 
IMR12072261 

[4] D.J. Aldous, G. Miermont and J. Pitman: Brownian Bridge Asymptotics for Random 
p-Mappings. Electr. J. Probab. 9, 37-56, f2004). lATticIel 

[5] D.J. Aldous, G. Miermont and J. Pitman: Weak Convergence of Random p- 
Mappings and the Exploration Process of the Inhomogeneous Continuum Random Tree. 
larXiv : math . PR/040 1 1 1 51 

[6] D.J. Aldous and J. Pitman: Brownian bridge asymptotics for random mappings. Random 
Structures Algorithms 5, 487-512 (1994). IMR1293075I 

[7] D.J. Aldous and J. Pitman: The standard additive coalescent. Ann. Probab 26, 1703-1726 
fl998) lMRT675063l 

[8] D.J. Aldous and J. Pitman: A family of random trees with random edge-lengths. Random 
Structures Algorithms 15 176-195 (1 999). IMRI 7043431 

[9] D.J. Aldous and J. Pitman: Inhomogeneous continuum random trees and the entrance 
boundary of the additive coalescent. Probab. Th. Rel. Fields 118, 455-482 (2000). 
IMR18083721 

[10] D.J. Aldous and J. Pitman: Invariance principles for non-uniform random mappings and 
trees. In V. Malyshev and A.M. Vershik, editors, Asymptotic Combinatorics with Ap- 
plications in Mathematical Physics, pp. 113-147. Kluwer Academic Publishers (2002). 

IMR19993581 

[11] J. Bertoin: On the local rate of growth of Levy processes with no positive jumps. Stoch. 
Proc. App. 55, 91-100 (1995) IMR1312150I 

[12] J. Bertoin: A fragmentation process connected to Brownian motion. Probab. Theory Relat. 
Fields 117, 289-301 (2000). IMRT771665I 

[13] J. Bertoin: Eternal additive coalescent and certain bridges with exchangeable increments. 
Ann. Probab. 29, 344-360 (2001). IMR1825153I 

[14] P. Biane and M. Yor: Sur la loi des temps locaux browniens pris en un temps exponentiel. 
In Seminaire de Probabilites XXII, pp. 454-466. Lecture Notes in Math. 1321, Springer 
(1988). MR960541 



36 



[15] M. Camarri and J. Pitman: Limit distributions and random trees derived from the birth- 
day problem with unequal probabilities. Electron. J. Probab. 5, Paper 2, 1-18 (2000). 
IMR17417741 

[16] P. Chassaing and G. Louchard: Phase transition for parking blocks, Brownian excursion 
and coalescence. Random Structures Algorithms 21, 76-119 (2002). MR1913079 

[17] T. Duquesne and J.-F. Le Gall: Random trees, Levy processes and spatial branching 
processes. Astensque 281 (2002). IMR1954248I 

[18] R. Durrett. Probability: theory and examples. Duxbury Press, Belmont, CA, second edition 
fl996). lMRT609153l 

[19] T. Jeulin: Application de la theorie du grossissement a l'etude des temps locaux browniens. 
In T. Jeulin and M. Yor (eds.): Grossissements de nitrations: exemples et applications. 
Lecture notes in Maths 1118, Springer, Berlin (1985). IMR884713I 

[20] O. Kallenberg: Canonical representations and convergence criteria for processes with in- 
terchangeable increments. Z. Wahrsch. Verw. Gebiete 27, 23-36 (1973). MR394842 

[21] O. Kallenberg: Path properties for processes with independent and interchangeable incre- 
ments. Z. Wahrsch. Verw. Gebiete 28, 257-271 (1974). MR402901I 

[22] G. Kersting: On the height profile of a conditioned Galton- Watson tree. Preprint (1998). 

[23] F. B. Knight: The uniform law for exchangeable and Levy process bridges. Hommage a P. 
A. Meyer et J. Neveu. Astensque 236, 171-188 (1 996). IMRI 41 79821 

[24] J.-F. Le Gall: The uniform tree in a Brownian excursion. Probab. Th. Rel. Fields 96, 
369-383 (1 9931. IMRT231 930I 

[25] J.-F. Le Gall: Brownian excursions, trees and measure-valued branching processes. Ann. 
Probab. 19. 1299-1439 (1991). IMR1127710I 

[26] J.-F. Le Gall and Y. Le Jan: Branching processes in Levy processes: The exploration 
process. Ann. Probab. 26, 213-252 (1998). IMR1617047I 

[27] J.-F. Marckert and A. Mokkadem: The depth-first processes of Galton- Watson trees con- 
verge to the same Brownian excursion. Ann. Probab. 31, 1655-1678 (2003). MR1989446 

[28] G. Miermont: Ordered additive coalescent and fragmentations associated to Levy processes 
with no positive jumps. Electr. J. Probab. 6, paper 14 (33 pages) (2001). Article MR1844511 

[29] P.W. Millar: Comparison theorems for sample function growth. Ann. Probab. 9, 330-334 
(1981). MR606997 

[30] C.A. O'Cinneide and A.V. Pokrovskii: Nonuniform random transformations. Ann. Appl. 
Probab. 10 (4), 1151-1181 (2000). MR1810869 



37 



[31] J. Pitman: Random mappings, forests and subsets associated with Abel-Cayley-Hurwitz 
multinomial expansions. Seminaire Lotharingien de Combinatoire, Issue 46, 45 pp. (2001). 
IArticlellMH.18776341 

[32] J. Pitman and M. Yor: Arcsine laws and interval partitions derived from a stable subor- 
dinator. Proc. London Math. Soc. (3) 65, 326-356 (1992). IMR1168191I 

[33] D. Revuz and M. Yor: Continuous martingales and Brownian motion. Springer, Berlin- 
Heidelberg, third edition (1999). IMR1725357I 

[34] W. Vervaat: A relation between Brownian bridge and Brownian excursion. Ann. Probab. 
7, 143-149 fl979). lMP3T5820l 



38 



